Using Finite State Automata for Sequence Mining
نویسنده
چکیده
We show how frequently occurring sequential patterns may be found from large datasets by first inducing a finite state automaton model describing the data, and then querying the model.
منابع مشابه
Reduction of Computational Complexity in Finite State Automata Explosion of Networked System Diagnosis (RESEARCH NOTE)
This research puts forward rough finite state automata which have been represented by two variants of BDD called ROBDD and ZBDD. The proposed structures have been used in networked system diagnosis and can overcome cominatorial explosion. In implementation the CUDD - Colorado University Decision Diagrams package is used. A mathematical proof for claimed complexity are provided which shows ZBDD ...
متن کاملLanguage Independent Transliteration Mining System Using Finite State Automata Framework
We propose a Named Entities transliteration mining system using Finite State Automata (FSA). We compare the proposed approach with a baseline system that utilizes the Editex technique to measure the length-normalized phonetic based edit distance between the two words. We submitted three standard runs in NEWS2010 shared task and ranked first for English to Arabic (WM-EnAr) and obtained an Fmeasu...
متن کاملAutomata Theory Approach for Solving Frequent Pattern Discovery Problems
The various types of frequent pattern discovery problem, namely, the frequent itemset, sequence and graph mining problems are solved in different ways which are, however, in certain aspects similar. The main approach of discovering such patterns can be classified into two main classes, namely, in the class of the levelwise methods and in that of the database projection-based methods. The level-...
متن کاملA Tool for Runtime Reliability Estimation and Control Using Automata Based Software Reliability Model
-The paper presents a novel approach for runtime reliability estimation of executable software during software execution. The proposed model does not depend on post failure data analysis whose accuracy depends on assumptions about some dubious statistical distribution. Instead the model captures required estimates during runtime from state to state transition. Thus, the model computes software ...
متن کاملSequential Pattern Mining Using Formal language Tools
In present scenario almost every system and working is computerized and hence all information and data are being stored in Computers. Huge collections of data are emerging. Retrieval of untouched, hidden and important information from this huge data is quite tedious work. Data Mining is a great technological solution which extracts untouched, hidden and important information from vast databases...
متن کامل